Annotating the functional chunks in Chinese sentences
نویسندگان
چکیده
The paper proposed a new syntactic annotation scheme --functional chunk, which tried to represent information about grammatical relations between sentence-level predicates and their arguments. Under this scheme, we built a Chinese chunk bank with about two million Chinese characters, and developed some learned models for automatically annotating fresh text with functional chunks. We also proposed a two-stages approach to build Chinese tree bank on the top of chunk bank, and gave some experimental results of chunk-based syntactic parser to show the advantage of functional chunk for parsing performance increase. All these work lays good foundations for further research project to build a large scale Chinese tree bank. 1 Elliott F. Drabek has been a Ph.D. candidate in John Hopkins University since Sept. 2001.
منابع مشابه
Word Alignment of English-Chinese Bilingual Corpus Based on Chucks
In this paper, a method for the word alignment of English-Chinese corpus based on chunks is proposed. The chunks of English sentences are identified firstly. Then the chunk boundaries of Chinese sentences are predicted by the translations of English chunks and heuristic information. The ambiguities of Chinese chunk boundaries are resolved by the coterminous words in English chunks. With the chu...
متن کاملSemEval-2010 Task 11: Event Detection in Chinese News Sentences
The goal of the task is to detect and analyze the event contents in real world Chinese news texts. It consists of finding key verbs or verb phrases to describe these events in the Chinese sentences after word segmentation and part-of-speech tagging, selecting suitable situation descriptions for them, and anchoring different situation arguments with suitable syntactic chunks in the sentence. Thr...
متن کاملSemantic Labeling of Chinese Serial Verb Sentences Based on Feature Structure
Parsing Chinese serial verb sentences is a key issue in NLP. Many controversies arise from serial verb sentences. This paper puts forward a novel model “the Feature Structure theory” to resolve the semantic labeling of Chinese serial verb sentences. We analyze the difficulties in annotating these sentences, and compare Feature Structure with traditional dependency structure. Feature Structure r...
متن کاملChunk-Level Reordering of Source Language Sentences with Automatically Learned Rules for Statistical Machine Translation
In this paper, we describe a sourceside reordering method based on syntactic chunks for phrase-based statistical machine translation. First, we shallow parse the source language sentences. Then, reordering rules are automatically learned from source-side chunks and word alignments. During translation, the rules are used to generate a reordering lattice for each sentence. Experimental results ar...
متن کاملA Language Learning System with Automatic Feedback: An Application Based on a English-Chinese Parallel Corpus
The learners of English as a second language generally need many practices on writing English, identification of the mistakes in their English, and feedback hints on how to correct their mistakes. A computer-assisted online system is designed to address these issues in a context of learning from a corpus of parallel English-Chinese corpus of New York Times news articles. In the system, students...
متن کامل